(19) 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 




(12) 



(43) Date of publication: 

12.03.2003 Bulletin 2003/11 

(21) Application number: 01402301.4 

(22) Date of filing: 05.09.2001 



(H) EP1 292 090 A1 

EUROPEAN PATENT APPLICATION 

(51) Intel/: H04M 3/56, H04M 1/57 



(84) Designated Contracting States: 


• L'Huillier, Nicholas Daniel 


AT BE CH CY DE DK ES Fl FR GB GR IE IT LI LU 


Coignidres 78310 (FR) 


MC NL PT SE TR 


• Charlton, Patricia Mary 


Designated Extension States: 


Bayswater, London W2 5RB (GB) 


AL LT LV MK RO SI 






(74) Representative: Jepsen, Rend Pihl et al 


(71) Applicant: MOTOROLA, INC. 


Motorola, 


Schaumburg, IL 60196 (US) 


European Intellectual Property Section, 




Midpoint, 


(72) Inventors: 


Alengon Link 


• Taib, Ronnie Bernard 


Basingstoke, Hampshire RG21 7PL (GB) 


Marseille 13010 (FR) 





< 

© 
o 

CM 

o> 

CM 

CL 
LU 



(54) Conference calling with speaker identification 



(57) A method of conducting a conference call, com- 
prising: identifying a user (2) of a first communication 
unit (12) speaking during the conference call; transmit- 
ting data related to the identified speaker (2) to other 
communication units (14, 16, 28) being used by other 
users (4, 6, 8) participating in the conference call; the 
communication units (14, 16, 18) receiving the data re- 



lated to the identified speaker (2) and displaying speak- 
er data based on the received data related to the iden- 
tified speaker (2). The speaker (2) may be identified by 
comparing the speech being spoken by the user with a 
voice profile for that user. Also described is a corre- 
sponding communication system, a caller identification 
module (21), and adapted communication units (12, 14, 
16, 18). 
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Description 

Field of the Invention 

[0001] This invention relates to the implementation of 
conference calls in communication systems. 

Background of the Invention 

[0002] Telecommunication systems are conventional- 
ly able to set up and conduct conference calls, i.e. calls 
in which users of more than two communication units, 
such as telephones, mobile telephones, computers, 
participate in a call. 

[0003] It is known to identify the caller at the beginning 
of a telephone call by displaying the caller's telephone 
number and then possibly displaying information about 
the caller, US6020916 describes a kind of videoconfer- 
ence, which displays pictures of all participants in addi- 
tion to their voice. WO-0105136 enables personal data 
(credit-card number, social security number, etc) to be 
sent to the other party during a phone call. WO-0075801 
sends customised advertisements along with conversa- 
tion data. WO-01 03406 allows a picture of the caller to 
be sent at the beginning of a call. 
[0004] However, these known forms of identification, 
developed for simple one-to-one calls, do not alleviate 
a disadvantage that arises with conference calls, as fol- 
lows. A disadvantage with conventional arrangements 
for conference calls involving several speakers is that 
the conversation quickly becomes anonymous, since 
there are difficulties in identifying who is currently speak- 
ing. Most of the time speakers have to identify them- 
selves each time they take the floor. This quickly be- 
comes tedious. Furthermore, when a speaker forgets to 
identify himself/herself, it becomes difficult to know from 
whom the last ideas emanated. 
[0005] In WO0105136, data and voice cannot be sent 
simultaneously - there is a manual switch between 
them. 

[0006] Presentation of the caller phone number, WO- 
0075801, US6020916 and WO-0103406 do not help to 
track the caller in real-time during the conference call. 
More specifically, WO-0075801 uses the speaker profile 
to send him advertisements, not to spread it to the other 
users. 

[0007] Thus, there exists a need in the field of the 
present invention to provide an improved way of con- 
ducting conference calls such that the abovementioned 
disadvantages may be alleviated. 

Statement of Invention 

[0008] In a first aspect, the present invention provides 
a method of conducting a conference call, as claimed in 
claim 1 . 

[0009] In a second aspect, the present invention pro- 
vides a communication system for carrying out a con- 
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ference call, as claimed in claim 11. 
[0010] In a third aspect, the present invention pro- 
vides apparatus for conducting a conference call, as 
claimed in claim 20. 

5 [0011] In a fourth aspect, the present invention pro- 
vides a communication unit for participating in a confer- 
ence call, as claimed in claim 27. 
[0012] In a fifth aspect, the present invention provides 
a storage medium, as claimed in claim 30. 

10 [0013] Further aspects are as claimed in the depend- 
ent claims. 

Brief Description of the Drawings 

15 [0014] Exemplary embodiments of the present inven- 
tion will now be described, with reference to the accom- 
panying drawings, in which: 

FIG. 1 shows part of a communication system in 
20 which the present invention may be embodied; 

FIG. 2 is a flowchart showing process steps per- 
formed to determine the type of communication unit 
and/or the display capability for each communica- 
25 tion unit involved in a conference call; and 

FIG. 3 is a flowchart showing process steps em- 
ployed for identifying speakers and transmitting da- 
ta in an embodiment of the invention. 

30 

Description of Preferred Embodiments 

[0015] FIG, 1 shows part of a communication system 
in which the present invention may be embodied. Users 

35 2, 4, 6 and 8 are employing respective communication 
units 12, 14, 16 and 18 to participate in a conference 
call. The connections for the conference call comprise 
respective communication links 22, 24, 26, 28 from the 
communication units 12, 14, 16, 18 to a public switched 

40 telephone network (PSTN) 30. In addition, a conference 
call control module 20 is also connected to the PSTN 
30, and in operation sets up and controls participation 
in the conference call. 

[0016] The above arrangement corresponds to a con- 
45 ventional conference call arrangement, and may be im- 
plemented in any conventional manner. Also, any stand- 
ard modifications, alternative layouts, etc, may be incor- 
porated. 

[0017] However, in this embodiment the conference 
50 call control module 20 and communication units 12, 14, 
16, 18 are modified to implement improved conference 
call operation, in particular to determine which user is 
speaking and to identify this, and other information, to 
the other users, as will be described in more detail be- 
55 low. In this embodiment the communication units 12, 14, 
16, 18 are so modified by re-programming of their main 
processors, (alternatively modification may be by using 
volatile memories or running applications), and the con- 
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ference call control module 20 is so modified by inclu- 
sion of apparatus, namely a caller identification module 
21, incorporated therein. 

[0018] More generally, apparatus for implementing 
the procedures to be described below may be provided 5 
by adapting conventional apparatus and/or providing 
additional modules. The apparatus may be in the form 
of hardware, firmware, or software, or a combination of 
these. The apparatus may comprise one or more proc- 
essors, for implementing instructions and using data 10 
stored in a storage medium such as a computer disk or 
PROM. The apparatus may be distributed between a 
number of communication system components or units. 
The apparatus may be located with general conference 
call controlling apparatus or separate therefrom. t$ 
[0019] Although in this embodiment each of the com- 
munication units 12, 14, 16, 18 are telephones, alterna- 
tively or additionally any suitable communication unit 
may be employed, for example mobile telephone, land 
mobile radio handset, personal computer, etc. In this 20 
case the communication links are of suitable type, e.g. 
radio link plus cellular radio system infrastructure links 
in the case of a mobile telephone. 
[0020] In summary, the caller identification module 21 
identifies which user is speaking. The way the speaker 25 
recognition may be performed is described in further de- 
tail below. Then, based on this identification, this infor- 
mation about the identified speaker is displayed on the 
communication units of the other users. 
[0021] Such data about the speaker may come either 30 
from a listener database forming part of the caller iden- 
tification module 21 , or be provided by the speaker him- 
self or even by a third party. Thus, a real-time tracking 
of the speakers on each communication unit may be 
provided. 35 
[0022] Optionally, the information to be displayed may 
be in a scalable or hierarchical form. Then, the caller 
identification module 21 determines the display capabil- 
ity of each of the communication units 12, 14, 16, 18and 
provides an amount of data, in order of the scaleable or 40 
hierarchical arrangement, to each communication unit 
that is commensurate with the display capability of the 
respective communication unit. 
[0023] Thus, for example, consider that user 2 is iden- 
tified as the speaker, and his information consists of: hi- 45 
erarchical level 1 - his name; hierarchical level 2 - the 
organisation he is representing; level 3 - his photograph; 
and level 4 - his organisation's logo. 
[0024] Further consider that a display screen of com- 
munication unit 14 is relatively large, that of comnriuni- so 
cation unit 1 6 is mid-size, and that of communication unit 
18 is small. Then, for example, communication unit 14 
may be provided with and display all four levels of data, 
i.e. name, organisation, photograph and logo; commu- 
nication unit 16 may be provided with and display just 55 
the top two levels of data, i.e. name and organisation; 
and communication unit 18 may be provided with and 
display just the top level of data, i.e. the speaker's name. 
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[0025] The following registration process, summa- 
rised in flowchart form in FIG. 2, may be carried out in 
order to determine the type of communication unit and/ 
or the display capability for each communication unit in- 
volved in the conference call. 
[0026] At step s2, a user, e.g. user 6, new to the con- 
ference call, wants to join the conference, and so regis- 
ters to the conference call control module 20 (which is 
serving as a conference steering application), using 
communication unit 16. Communication unit 16 sends a 
description of its technical capabilities/functions to the 
caller identification module 21 of the conference call 
control module 20. In particular the information de- 
scribes the display screen capability (as described 
above), and whether the communication unit 16 is able 
to store other user data. If it is not, then the registration 
process is completed. 

[0027] However, if the communication unit is able to 
store other user data, then the process moves to step 
s6 where the caller identification module 21 sends other 
user data to the communication unit 16. The other user 
data comprises the data that is to be displayed for each 
of the other users in the conference call when that 
speaker is identified as a current speaker. Moreover, 
since in this example the optional hierarchical feature is 
employed, this information will be sent to the level of 
detail commensurate with the display capability of com- 
munication unit 16. 

[0028] Thus, in this example, the name and organisa- 
tion of each of users 2, 4 and 8 is sent to the communi- 
cation unit 16, which stores this data. The data for each 
other user is sent, and stored, referenced to a user ID 
allocated to individual users by the caller identification 
module 21. In this example this is only done for users 
engaged in the conference call, and details are erased 
after the end of the call. However, in other embodiments 
a communication unit may store a number of user IDs 
for users regularly called, with corresponding data, such 
that these may be used as required over the course of 
different conference calls over time. 
[0029] FIG. 3 shows process steps employed for iden- 
tifying speakers and transmitting data in this embodi- 
ment. At step s12 one of the users speaks. 
[0030] At s14 the caller identification module deter- 
mines which user is speaking. In this embodiment this 
is performed by the caller identification module 21 com- 
paring the current speech of the user with speech pro- 
files it holds for each of the users involved in the confer- 
ence call. The speech profiles may be previously ac- 
quired by and stored at the caller identification module 
in any suitable manner. 

[0031] In this embodiment each of the users, on reg- 
istration, have, in addition to the steps shown in FIG. 2, 
also entered a standard portion of speech which the call- 
er identification module 21 has analysed to provide that 
user's speech profile. 

[0032] Another exemplary possibility is for data defin- 
ing the speech profile of a user to be stored in that user's 
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communication unit, and sent to the caller identification 
module 21 as another additional part of the above de- 
scribed registration process. 

[0033] Another possibility is that speech profiles may 
be determined and stored at a separate module or da- 
tabase, referenced against unique user IDs, and the 
caller identification module 21 retrieves the speech pro- 
files as required when it ascertains the ID of a user join- 
ing the conference call. 

[0034] The caller identification module 21 compares 
the user's speech with the various speech profiles of the 
users involved in the conference call. This may be per- 
formed in any conventional manner, using for example 
the techniques disclosed in WO-9944380 and/or WO- 
9900719, the contents of each of which are included 
herein by reference. When a match is found, the caller 
identification module has successfully recognised and 
identified the speaker. 

[0035] Once the caller identification module 21 has 
identified the speaker, then in steps s1 6-s22, it transmits 
data related to the identified speaker to each of the other 
user's communication units. For each such recipient 
communication unit, the following steps are carried out. 
At step s16, the caller identification module 21 deter- 
mines whether the respective communication unit sup- 
ports the above described data storage. If it does, then 
at step s18 the caller identification module 21 simply 
sends the user ID of the speaker to the communication 
unit, which then cross-references the user ID to its 
stored other user data in order to determine the speaker- 
related data to be displayed. 

[0036] If, however, the respective recipient communi- 
cation unit does not support the above described data 
storage, then at step s20 the caller identification module 
21 formats the speaker-related data in a form appropri- 
ate for the respective recipient communication unit. At 
step s22 the caller identification module 21 sends that 
data to the recipient communication unit. 
[0037] (Steps s1 6-s22 are repeated for each recipient 
communication unit, i.e. for each user in the conference 
call apart from the speaker. Optionally, the data can 
even be sent to the speaker, for verification purpose 
and/or in the event that a log is being recorded and 
stores at the communication unit of all the speakers 
identified, e.g. for transcript or other purpose). 
[0038] When the identified speaker stops talking, at 
step s24, the cailer identification module 21 determines 
whether there is a new speaker. If so, then the process 
is returned to step s14 and repeated for the new speak- 
er. If there is no further speaker, then the process is com- 
pleted. 

[0039] In a further embodiment, a user may optionally 
add his own input to the data that is to be displayed for 
each speaker. For example, user 8 may enter into his 
communication unit 18 the additional data that user 2 is 
from a rival organisation whereas user 4 is from an or- 
ganisation that is in partnership with his own organisa- 
tion. 



[0040] Generally, in all the above embodiments, the 
data to be displayed may be in any form of words, pic- 
tures, symbols etc. as required. 
[0041] In the above embodiments the speaker is iden- 

5 tified by comparing the speech input with speech pro- 
files. Other means for identifying the speaker may be 
employed. For example, the telephone number of the 
speaker may be identified, although this has a disad- 
vantage that only one speaker per communication unit 

10 may be accommodated, whereas the above embodi- 
ments may accommodate more than one user per com- 
munication unit. 

[0042] In further embodiments, the users may be rep- 
resented by autonomous entities that own data about 
15 them (the voice profile, the information details, the de- 
vice capabilities, etc.). 

[0043] Software agents are well suited for this task, 
and can be used by the user to ask to join a conference. 
The agent can then contact one of the agents already 
20 in the conference, or possibly organise the conference 
itself if it does not exist. 

[0044] Some agents representing initiator partici- 
. pants may create a "conference agent" whose role is to 
manage the conference and to register newcomers. The 

25 newcomers can send their details to this agent that will 
then integrate them in the above processes. 
[0045] Suitable types of software agents are de- 
scribed in the Applicant's co-pending patent application 
GB0020981 .7, filed 25 August 2000, Applicant's refer- 

30 ence CE00315UM, the contents of which are hereby in- 
corporated by reference. 

[0046] It will be understood that the above described 
embodiments tend to provide the following advantages: 

35 (j) it is possible to track the identity of a speaker in- 
volved a conference call, in real-time, without hu- 
man intervention and to display an output on other 
participants' devices; 

40 (ii) the output provides data about the speaker that 
are relevant to the listener; 

(iii) the output can be picture, text or even multime- 
dia; and 

45 

(iv) the device capability and the bandwidth it uses 
are optimised since the data sent to each device are 
formatted precisely for the device. 

50 [0047] Thus, an improved way of conducting confer- 
ence calls has been provided that at least alleviates 
some of the aforementioned disadvantages associated 
with prior art arrangements. 

55 

Claims 

1. A method of conducting a conference call, compris- 
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ing the step of: 



ence call, comprising: 



identifying a user (2,4,6,8) of a first communi- 
cation unit (12,14,16,18) speaking during the 
conference call; 5 

the method characterised by the step of: 

transmitting data related to the identified 
speaker to other communication units 10 
(12,14,16,18) being used by other users 
(2,4,6,8) participating in the conference call. 

A method according to claim 1, further comprising 
the step of: . 15 



means (21 ) for identifying a user of a first com- 
munication unit (12,14,16,18) speaking during 
the conference call; characterised by: 

means for transmitting data related to the 
identified speaker to other communication 
units (12,14,16,18) being used by other us- 
ers participating in the conference call; and 
means for one or more of the communica- 
tion units (12,14,16,18) to receive the data 
related to the identified speaker and to dis- 
play speaker data based on the received 
data related to the identified speaker. 



receiving, by one or more of the communication 
units (1 2, 1 4, 1 6, 1 8), the data related to the iden- 
tified speaker and displaying speaker data 
based on the received data related to the iden- 
tified speaker. 

3. A method according to claim 1 , wherein the identi- 
fied speaker is identified by comparing the speech 
being spoken by the user with a voice profile for that 
user. 

4. A method according to claim 3, wherein the voice 
profile is determined during the conference call. 

5. A method according to claim 3, wherein the voice 
profile is determined and stored in advance of the 
conference call. 

6. A method according to any of claims 2 to 5, wherein 
the displayed speaker data comprises information 
provided by the speaker. 

7. A method according to any of claims 2 to 6, wherein 
the displayed speaker data comprises information 
allocated to the particular speaker by the other user. 

8. A method according to any preceding claim, where- 
in for each respective other user communication 
unit, the data related to the identified speaker com- 
prises data selected from a hierarchy of data relat- 
ing to the identified speaker according to a data dis- 
playing capability of respective other user's commu- 
nication unit. 

9. A method according to any preceding claim, where- 
in the users are represented by autonomous enti- 
ties that own data about the respective users. 

10. A method according to claim 9, wherein software 
agents are employed. 

11. A communication system for carrying out a confer- 



12. A system according to claim 11 , wherein the identi- 
fied speaker is identified by comparing the speech 
being spoken by the user with a voice profile for that 

20 user. 

13. A system according to claim 12, wherein the voice 
profile is determined during the conference call. 



25 



14. A system according to claim 12, wherein the voice 
profile is determined and stored in advance of the 
conference call. 



15. 



30 



A system according to any of claims 11 to 1 4, where- 
in the displayed speaker data comprises informa- 
tion provided by the speaker. 



35 



40 



45 



50 



55 



16. A system according to any of claims 11 to 15, where- 
in the displayed speaker data comprises informa- 
tion allocated to the particular speaker by the recip- 
ient other user. 

17. A system according to any of claims 11 to 16, where- 
in for each respective other user communication 
unit, the data related to the identified speaker com- 
prises data selected from a hierarchy of data relat- 
ing to the identified speaker according to a data dis- 
playing capability of respective other user's commu- 
nication unit. 

1 8. A system according to any of claims 1 1 to 1 7, further 
comprising means for representing the users by au- 
tonomous entities that own data about the respec- 
tive users. 

19. A system according to claim 18, comprising means 
for implementing software agents. 

20. Apparatus for conducting a conference call, com- 
prising: 

means (21) for identifying a user (2,4,6,8) of a 
first communication unit (12,14,16,18) speak- 
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ing during the conference call; 
the apparatus characterised by: 



means for transmitting data related to the iden- 5 
tified speaker to other communication units 
(12,14,16,18) being used by other users 
(2,4,6,8) participating in the conference call. 

21. Apparatus according to claim 20, wherein the iden- 10 
tified speaker is identified by comparing the speech 
being spoken by the user with a voice profile for that 
user. 



22. Apparatus according to claim 21 , wherein the voice * 5 
profile is determined during the conference call. 

23. Apparatus according to claim 21 , wherein the voice 
profile is determined and stored in advance of the 
conference call. 20 

24. Apparatus according to any of claims 20 to 23, 
wherein for each respective other user communica- 
tion unit (12,14,16,18), the data related to the iden- 
tified speaker comprises data selected from a hier- 25 
archy of data relating to the identified speaker ac- 
cording to a data displaying capability of respective 
other user's communication unit. 

25. Apparatus according to any of claims 20 to 23, fur- 30 
ther comprising means for representing the users 

by autonomous entities that own data about the re- 
spective users. 



26. Apparatus according to claim 25, comprising means 35 
for implementing software agents. 

27. A communication unit (12,14,16,18) for participat- 
ing in a conference call, characterised by: 

40 

means for receiving data related to an identified 
speaker; and 

means for displaying speaker data based on 
the received data related to the identified 
speaker. 45 



28. A communication unit according to claims 27, 
wherein the displayed speaker data comprises in- 
formation provided by the speaker. 

29. A communication unit according to claim 27 or 28, 
wherein the displayed speaker data comprises in- 
formation allocated to the particular speaker by the 
user of the communication unit. 



30. A storage medium storing processor-implementa- 
ble instructions for controlling a processor to carry 
out the method any of claims 1 to 10. 
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